NXP Backend: Add imxrt700cm backend which combines the Neutron and CortexM backends by MartinPavella · Pull Request #18488 · pytorch/executorch

MartinPavella · 2026-03-25T09:09:57Z

Summary

Add imxrt700cm backend which combines the Neutron and CortexM backends into one. The backend uses Neutron wherever possible, and the leftover nodes are handled by Cortex-M.

Test plan

Unit tests provided

cc @robert-kalmar @JakeStevens @digantdesai

pytorch-bot · 2026-03-25T09:10:02Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18488

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures, 1 Cancelled Job, 2 Unrelated Failures

As of commit ea7648a with merge base 5e77594 ():

NEW FAILURES - The following jobs have failed:

pull / test-openvino-linux / linux-job (gh)
RuntimeError: Command docker exec -t 00ab624d800a42d455046e2f2058b7a349ace3ecaa9c630ce429759b5096d16c /exec failed with exit code 1
pull / unittest-nxp-neutron / linux-job (gh)
RuntimeError: Command docker exec -t c8a2ce0ff2e30670aebe317a3478f9286cf6b4ae86c9b7d979d3c137cbf2e4d1 /exec failed with exit code 1

CANCELLED JOB - The following job was cancelled. Please retry:

Test CoreML Backend / test-coreml / test-backend-macos (coreml, operators) / macos-job (gh)
##[error]The operation was canceled.

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / unittest-editable / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

digantdesai · 2026-03-25T18:19:52Z

examples/nxp/experimental/cifar_net/cifar_net.py

                             training, the weights will be stored in the file.
    :param train: Boolean indicating whether to train the model.
    :param num_epochs: Number of epochs to use during training.
+    :param cortex_m_safe: There is a bug in the Cortex-M backend related to the `pad` operator. If this parameter is


Let's fix this if it is something quick as opposed to introducing a new bypass logic? WDYT - CC @rascani since you were discussing this earlier this week in the context of NHWC.

Good point.

The issue with the pad operator is that calling the pad replacement op in

executorch/backends/cortex_m/passes/quantized_op_fusion_pass.py

Lines 410 to 415 in 28b4813

case exir_ops.edge.aten.constant_pad_nd.default:

op, args = self._get_pad_replacement(args, meta)

case _:

pass

result = super().call_operator(op, args, {}, meta)

would produce a contiguous output even when the input had the channels last dim order. I tried to look for the root cause but I didn't find it, so I opted for the bypass and I planned to report the issue after raising this PR.

I have a fix for pad here: #18429

Thank you!
I have removed the workaround from our testing model.

digantdesai · 2026-03-25T18:32:18Z

backends/nxp/imxrt700cm/imxrt700cm_quantizer.py

+from torchao.quantization.pt2e.quantizer.quantizer import Q_ANNOTATION_KEY
+
+
+class IMXRT700CMQuantizer(Quantizer):


Quantizers are meant to be composible. Recipe is the right user facing abstraction to target an SoC with multiple different backends. Take a look at https://github.com/pytorch/executorch/blob/main/export/tests/test_target_recipes.py especially something like get_android_recipe to understand how two or more quantizers / partitioners are encapsulated and made to work together.

In your case, I imagine a target recipe for rt700 with neutron and cortex-m.

Thank you @digantdesai for the insights. I have looked into it, and recipes definitely look like the right way forward.
I analyzed the state in executorch:

To introduce an SoC recipe would require having recipes for both Neutron and Cortex-M backend (both missing). Alternatively the current Cortex-M and Neutron pipelines can be combined into a single recipe but from reuse perspective a base recipe for both backend seems better from my opinion. Our Neutron backend pipeline is currently implemented in

executorch/backends/nxp/tests/executorch_pipeline.py

Line 117 in 28b4813

def to_quantized_edge_program(

The Neutron pipeline contains some kernel registration functionality, as only it knows what NPU kernels are requires. This would probably require the creation of a new Stage type

executorch/backends/nxp/tests/executorch_pipeline.py

Line 117 in 28b4813

def to_quantized_edge_program(

Or at least I didn't find any stage providing the functionality to just execute a function based on presence of an option.

The QAT appears to not be supported. The QuantizeStage explicitly states it performs post-training quantization. I see that the SourceTransformStage also enables quantization in some way, but it doesn't seem QAT is supported. So perhaps this would require another new Stage type (or modification on an existing stage).

Given this, enabling the RT700 Neutron+Cortex-M backend via a recipe requires changes in multiple backends, and this PR would end up quite large. Can we do this in multiple stages? Such as:

Experimentally, continue with this early implementation introducing the option to combine Cortex-M and Neutron Backends for the i.MXRT700.

Rework the current Neutron lowering pipeline to a recipe, and the same for the Cortex-M backend. Here we would potentially introduce new Stages.

Rework the imxrt700cm lowering to a recipe

Based on consequent discussion extend for QAT training.

For Cortex-M we need to sync with Arm too.
What is your opinion?

examples/nxp/experimental/cifar_net/cifar_net.py

…otated.

…ors. The operator `dim_order_ops._clone_dim_order.default` uses the `kwargs` to determine the output dim order. Since the `kwargs` were always empty, this operator produced in incorrect result in the pass, which broke the rest of the model.

…t8 datasets.

…rtexM backends

MartinPavella requested review from jirioc and roman-janik-nxp March 25, 2026 09:09

MartinPavella self-assigned this Mar 25, 2026

MartinPavella requested review from rascani and robert-kalmar as code owners March 25, 2026 09:09

MartinPavella added module: nxp Issues related to NXP Neutron NPU delegation and code under backends/nxp/ release notes: nxp Changes to the NXP Neutron backend delegate labels Mar 25, 2026

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 25, 2026

MartinPavella force-pushed the nxg01483/EIEX-762-create-the-aot-part-of-imxrt700cm-backend-combining-neutron-and-cortex-m branch 4 times, most recently from dda9ddd to b45f3f7 Compare March 25, 2026 14:12

digantdesai reviewed Mar 25, 2026

View reviewed changes

digantdesai requested changes Mar 25, 2026

View reviewed changes

robert-kalmar reviewed Mar 26, 2026

View reviewed changes

examples/nxp/experimental/cifar_net/cifar_net.py Outdated Show resolved Hide resolved

MartinPavella added 5 commits March 26, 2026 13:56

NXP backend: Replace pad operators in CifarNet with conv parameters.

b06067f

Update CortexMQuantizer to allow for selection of which nodes get ann…

a315440

…otated.

NXP Backend: Update dataset_creator to support generating random in…

09b4dec

…t8 datasets.

NXP Backend: Add imxrt700cm backend which combines the Neutron and Co…

ea7648a

…rtexM backends

MartinPavella force-pushed the nxg01483/EIEX-762-create-the-aot-part-of-imxrt700cm-backend-combining-neutron-and-cortex-m branch from b45f3f7 to ea7648a Compare March 26, 2026 12:59

MartinPavella mentioned this pull request Mar 26, 2026

Use padding=='same' for CifarNet model #13470

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NXP Backend: Add imxrt700cm backend which combines the Neutron and CortexM backends#18488

NXP Backend: Add imxrt700cm backend which combines the Neutron and CortexM backends#18488
MartinPavella wants to merge 5 commits intopytorch:mainfrom
nxp-upstream:nxg01483/EIEX-762-create-the-aot-part-of-imxrt700cm-backend-combining-neutron-and-cortex-m

MartinPavella commented Mar 25, 2026 •

edited by pytorch-bot bot

Loading

Uh oh!

pytorch-bot bot commented Mar 25, 2026 •

edited

Loading

Uh oh!

digantdesai Mar 25, 2026

Uh oh!

MartinPavella Mar 26, 2026

Uh oh!

rascani Mar 26, 2026

Uh oh!

MartinPavella Mar 26, 2026

Uh oh!

digantdesai Mar 25, 2026

Uh oh!

MartinPavella Mar 26, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	case exir_ops.edge.aten.constant_pad_nd.default:
	op, args = self._get_pad_replacement(args, meta)
	case _:
	pass

	result = super().call_operator(op, args, {}, meta)

		from torchao.quantization.pt2e.quantizer.quantizer import Q_ANNOTATION_KEY


		class IMXRT700CMQuantizer(Quantizer):

Conversation

MartinPavella commented Mar 25, 2026 • edited by pytorch-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

pytorch-bot bot commented Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18488

❌ 2 New Failures, 1 Cancelled Job, 2 Unrelated Failures

Uh oh!

digantdesai Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

MartinPavella Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

rascani Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

MartinPavella Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

digantdesai Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

MartinPavella Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

MartinPavella commented Mar 25, 2026 •

edited by pytorch-bot bot

Loading

pytorch-bot bot commented Mar 25, 2026 •

edited

Loading

MartinPavella Mar 26, 2026 •

edited

Loading